Using Prosodic and Spectral Features in Detecting Depression in Elderly Males
نویسندگان
چکیده
As research in speech processing has matured, there has been much interest in paralinguistic speech processing problems including the speaker’s mental and psychological health. In this study, we focus on speech features that can identify the speaker’s emotional health, i.e., whether the speaker is depressed or not. We use prosodic speech measurements, such as pitch and energy, in addition to spectral features, such as formants and spectral tilt, and compute statistics of these features over different regions of the speech signal. These statistics are used as input features to a discriminative classifier that predicts the speaker’s depression state. We find that with an N -fold leave-one-out cross-validation setup, we can achieve a prediction accuracy of 81.3%, where random guess is 50%.
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملDetecting Depression in Elderly People by Using Artificial Neural Network
Introduction: The possibility of depression is common in the elderly. Novel technologies allow us to monitor people related to depression. Hence, a model was provided to detect depression in elderly based on artificial neural network (ANN). Methods: The present study is an applied descriptive-survey research. Forty elderly people were randomly selected from the Elderly Care Center in Gonbad Ka...
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملSpeech emotion recognition using nonlinear dynamics features
Recent developments in man–machine interaction have motivated researchers to recognize human emotion from speech signals. In this study, we propose using nonlinear dynamics features (NLDs) for emotion recognition. NLDs are extracted from the geometrical properties of the reconstructed phase space of speech signals. The traditional prosodic and spectral features are also used as a benchmark. The...
متن کاملA New Dictionary Construction Method in Sparse Representation Techniques for Target Detection in Hyperspectral Imagery
Hyperspectral data in Remote Sensing which have been gathered with efficient spectral resolution (about 10 nanometer) contain a plethora of spectral bands (roughly 200 bands). Since precious information about the spectral features of target materials can be extracted from these data, they have been used exclusively in hyperspectral target detection. One of the problem associated with the detect...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011